Approximate Nearest Neighbor Algorithms for Hausdorff Metrics via Embeddings

نویسندگان

  • Martin Farach-Colton
  • Piotr Indyk
چکیده

Hausdorff metrics are used in geometric settings for measuring the distance between sets of points. They have been used extensively in areas such as computer vision, pattern recognition and computational chemistry. While computing the distance between a single pair of sets under the Hausdorff metric has been well studied, no results were known for the Nearest Neighbor problem under Hausdorff metrics. Indeed, no results were known for the nearest neighbor problem for any metric without norm structure, of which the Hausdorff is one. We present the first nearest neighbor algorithm for the Hausdorff metric. We achieve our result by embedding Hausdorff metrics into l∞ and using known nearest neighbor algorithms for this target metric. We give upper and lower bounds on the number of dimensions needed for such an l∞ embedding. Our bounds require the introduction of new techniques based on superimposed codes and non-uniform sampling. ∗Department of Computer Science, Rutgers University, Piscataway, NJ 08855, USA. ([email protected], http://www.cs.rutgers.edu/∼farach). Supported by NSF Career Development Award CCR-9501942, NATO Grant CRG 960215 and an Alfred P. Sloan Research Fellowship. †Supported by Stanford Graduate Fellowship and ARO MURI Grant DAAH04–96–1–0007. Part of this work was done while the author was visiting Bell Laboratories.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Approximate Nearest Neighbor Algorithms for Hausdorr Metrics via Embeddings

Hausdorr metrics are used in geometric settings for measuring the distance between sets of points. They have been used extensively in areas such as computer vision , pattern recognition and computational chemistry. While computing the distance between a single pair of sets under the Hausdorr metric has been well studied, no results were known for the Nearest Neighbor problem under Hausdorr metr...

متن کامل

Approximate nearest neighbor search for $\ell_p$-spaces ($2<p<\infty$) via embeddings

While the problem of approximate nearest neighbor search has been well-studied for Eu-clidean space and ℓ 1 , few non-trivial algorithms are known for ℓ p when 2 < p < ∞. In this paper, we revisit this fundamental problem and present approximate nearest-neighbor search algorithms which give the first non-trivial approximation factor guarantees in this setting.

متن کامل

Fast Nearest Neighbor Preserving Embeddings

We show an analog to the Fast Johnson-Lindenstrauss Transform for Nearest Neighbor Preserving Embeddings in `2. These are sparse, randomized embeddings that preserve the (approximate) nearest neighbors. The dimensionality of the embedding space is bounded not by the size of the embedded set n, but by its doubling dimension λ. For most large real-world datasets this will mean a considerably lowe...

متن کامل

Software Cost Estimation by a New Hybrid Model of Particle Swarm Optimization and K-Nearest Neighbor Algorithms

A successful software should be finalized with determined and predetermined cost and time. Software is a production which its approximate cost is expert workforce and professionals. The most important and approximate software cost estimation (SCE) is related to the trained workforce. Creative nature of software projects and its abstract nature make extremely cost and time of projects difficult ...

متن کامل

Partitioning Well-Clustered Graphs: Spectral Clustering Works!

In this work we study the widely used spectral clustering algorithms, i.e. partition a graph into k clusters via (1) embedding the vertices of a graph into a low-dimensional space using the bottom eigenvectors of the Laplacian matrix, and (2) partitioning the embedded points via k-means algorithms. We show that, for a wide class of graphs, spectral clustering algorithms give a good approximatio...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999